Automated unpacking of executables packed by multiple layers of arbitrary packers

ABSTRACT

The packing manager provides an automated method that allows existing AV scanning technology to be applied to detect known malware samples packed by one or more packers that are potentially proprietary. The packing manager tracks the memory areas to which an executable binary writes and executes, and so can unpack programs packed by multiple arbitrary packers without requiring reverse-engineering of the packers or any human intervention. By tracking page modification and execution of an executable binary at run time, the packing control module can detect the instant at which the program&#39;s control is first transferred to a page whose content is dynamically generated, so AV scanning can then be invoked. Thus, code cannot be executed under the packing control manager without being scanned by an AV scanner first.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention pertains in general to computer security, and more specifically to a system and method for automated unpacking of executables for malware detection.

2. Description of the Related Art

Computer systems are continually threatened by a risk of attack from malicious computer code or malware, code that enters a computer without an authorized user's knowledge and/or without an authorized user's consent, such as a virus, a worm, or a Trojan horse. Antivirus prevention/detection software can be installed on computers in an attempt to prevent malicious code attacks and to detect the presence of malicious code, by e.g., using signature detection methods. However, as malware evolves, the antivirus software too has to evolve to keep up with the latest malware.

Signature-based malware detection methods currently face a serious problem with an antivirus detection evasion tool called a “packer.” A packer is a tool that is used to take an existing piece of malware and hide or “pack” it to make it no longer detectable to most signature-based detection systems. A packer can change the byte-level appearance of a binary program without modifying its execution semantics. Because signatures used in anti-virus (AV) scanning engines are derived from the byte-level representations of malware samples, malware writers use packers to hide the malware program by changing its appearance. Even worse, malware writers can also apply different packers in different combinations to create a large number of variants of existing malware that can easily evade signature-based AV scanners.

Traditionally, AV companies have attempted to manage the packer problem by manually reverse-engineering popular packers and creating unpackers for them. Once an unpacker for a packer X is available, an AV scan engine can unpack binaries packed by X and can apply the standard scanning method on the unpacked malware. Unfortunately, there are a very large number of different packers; more than 1000 different packers are currently known. Thus, reverse-engineering each of these different packers is an impossible proposition because it is a slow and expensive process. AV companies can commonly deal with no more than 100 packers. Worst yet, packers are constantly evolving because the source code of some packers is available on the Internet for anyone to customize and tweak for evasion purposes. As a result, AV companies always lag behind the rate at which new packers are developed, and the detection rate of all AV engines on the market has fallen dramatically.

There are a number of shortcomings with each technique that has been used by AV companies to solve the packer problem. For example, one generic unpacking technique called “emulation” emulates each instruction of the unpacking code in a virtual system, but this emulation process can be very slow and requires large amounts of virtual memory, among other problems. Another generic unpacking technique called “dynamic translation” translates the program's code blocks to native code, which helps improve the speed of emulation, but this often fails to run correctly because it requires modification of the code and packers often check the integrity of the code. There are also tools for dealing with packers by manually marking potential address space as non-executable. Whenever the program wants to execute code in that address space, these tools can transfer control to a debugger for human being to take a look at it if the program is unpacked. However, these tools are not automated, are slow, intrusive, and require use of a debugger. Another method tracks memory write and memory execution by instrumenting the code, but this requires modifying the code and thus also does not work for packers which check the integrity of the code. None of these methods provides an automated, fast, non-intrusive, and effective solution to this cross-the-industry problem of dealing with malware packed by one or more arbitrary packers.

DISCLOSURE OF INVENTION

The packing manager system described here provides an automated method that allows existing AV scanning technology to be applied to detect known malware samples packed by one or more packers that are potentially proprietary. The packing manager tracks the memory areas to which an executable binary writes and executes, and so can unpack programs packed by multiple arbitrary packers without requiring reverse-engineering of the packers or any human intervention. By tracking page modification and execution of an executable binary at run time, the packing control module can detect the instant at which the program's control is first transferred to a page whose content is dynamically generated, so AV scanning can then be invoked. Thus, the packing manager is designed to avoid allowing code to be executed after unpacking without being scanned by an AV scanner first.

The packing manager includes a dynamic content detection module that detects an attempt by a program to dynamically generate content to a memory page, wherein the program is suspected to be unpacking malicious executable code to the memory page. In some embodiments, the memory page is initially marked by a marking module as executable but not writable prior to detecting the attempt to dynamically generate content to the memory page. In some embodiments, an exception handler can intercept a first exception generated when the program attempts to dynamically generate the content to the memory page set to be executable but not writable. The marking module marks the memory page to which the program attempted to dynamically generate content. For example, the page can be switched from non-writeable and executable to non-executable and writeable. A permission module allows the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page. An execution detection module detects an attempt by the program to execute the code stored in the memory page, the detection occurring before the execution occurs. In some embodiments, an exception handler can intercept a second exception upon detection of the attempt by the program to execute the code. An analysis module provides to a malicious code detection module the memory page for analysis for the presence of malicious code. These steps can be repeated for programs packed by multiple layers of packers, some of which may not have been previously known.

The features and advantages described in this disclosure and in the following detailed description are not all-inclusive, and particularly, many additional features and advantages will be apparent to one of ordinary skill in the relevant art in view of the drawings, specification, and claims hereof. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter, resort to the claims being necessary to determine such inventive subject matter.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 a is a high-level block diagram illustrating an example of the components of a program and a packed program.

FIG. 1 b is a high-level block diagram illustrating an example of the components a packed program illustrating changes to memory protection settings, according to one embodiment.

FIG. 1 c is a high-level block diagram illustrating an example of the components a packed program illustrating changes to memory protection settings, according to one embodiment.

FIG. 1 d is a high-level block diagram illustrating an example of the components a packed program illustrating changes to memory protection settings, according to one embodiment.

FIG. 2 is a high-level block diagram illustrating a standard computer system 200 for use with an embodiment.

FIG. 3 is a high-level block diagram illustrating the functional modules within the packing manager 120, according to one embodiment.

FIG. 4 is a flowchart illustrating steps performed by the packing manager, according to one embodiment.

The figures depict an embodiment of the present invention for purposes of illustration only. One skilled in the art will readily recognize from the following description that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the invention described herein.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

A “packer” is a program that can compress or encrypt an input executable (e.g. a binary file) into another executable that is different at the binary level. As an example of a typical packing process, a packer P reads in the byte sequence of the input binary O and compresses or encrypts this byte sequence into another byte sequence called O′, and adds an unpacker routine U to O′ to form the final packer output, P(O). When P(O) runs, the program's control first goes to the unpacker routine U. If AV scanning is performed on P(O) at this point, the scan will not detect the underlying malware program O as it is disguised by the packer. The unpacker routine U decompresses/decrypts O′ and restores the original program O in memory. Then, control is transferred to the main entry point of O as if the unpacking process never took place. Because P(O) is substantially different from O, the byte-level signature for O is not applicable to P(O). The AV engine fails to detect P(O) as malware, even if it behaves the same as O. Once the unpacking occurs, the execution of malware O can occur without detection as any AV scan that might have occurred would have occurred upon initial execution of P(O) and so would have already missed the malware. Malware writers can pack a piece of malware O with different packers and generate different variants such as P1(O), P2(O), etc. In addition, they can apply packing multiple times to a malware O and generate such variants as P1(P2(O)), P2(P1(O)), and so on. By applying multi-layer packing, even with a small number of packers, malware writers can easily generate a large number of variants from the same malware. If an AV scanner can successfully unpack arbitrarily packed binaries, only one signature is needed per malware; otherwise it needs to create a signature per variant per malware.

FIG. 1 a illustrates an executable program (Hello.exe 100) and illustrates that executable packed by a common packer UPX (Hello_upx.exe 102). The Hello.exe shows the standard PE header, code section, and data section that can be found in an executable program. The memory range of Hello.exe's 100 code and data sections is reserved as section UPX0 in the packed version, Hello_upx.exe 102. Data in Hello.exe's 100 code section and data section are compressed and stored in UPX1 as packed data. Unpacker code is added in UPX1 to restore Hello.exe's 100 code section and data section to UPX0 from the packed data in UPX1. The entry point of Hello_upx.exe 102 is modified to point to the unpacker code, and so the first action will be to unpack the program and restore the original binary in memory. Control will then be transferred to the original binary. The original malware is completely unpacked before control is transferred to it. Unpacking and execution are not interleaved and code pages stay in plaintext form simultaneously. The malware's post-load address space image remains the same.

Returning again to the prior example, when a packed binary P(O) runs, its unpacker routine restores the original program being packed, O, in memory at some point in time. In brief summary, the system described here functions by accurately catching this moment and invoking the AV scanner at that point so that existing AV scanning technology can be applied to P(O) as effectively as to O. In addition, the system can catch this unpacking moment in a way that is independent of the packing algorithms used in packers so this approach can be applied to P(O) regardless of the underlying packing techniques used in P.

The packing manager system described here manages the packer problem using an automated method that allows existing AV scanning technology to be applied to detect known malware samples packed by one or more packers that are potentially proprietary. Because there is no obvious difference between an unpacker routine and a binary that is packed, detecting the end of unpacking is non-trivial, and being able to do so in a way independent of the particular packer used is even more challenging. Regardless of the packer used, the unpacker routine in a packed binary at some point creates the original program being packed at run time and transfers control to it. So, during the execution of a packed binary, the code pages associated with the original program must be dynamically generated and later on executed. By catching control transfers to dynamically written pages, the system described here can catch the moment when the original program is restored.

The detection of control transfers to dynamically written pages by the packing manager system can be performed in different ways. In one embodiment, to reduce the performance overhead associated with the detection of control transfers to dynamically written pages, the packing manager system leverages the virtual memory protection hardware in modern CPUs (such as X86) to track pages that are written and executed. A brief summary of how this embodiment can function is provided here with regard to FIGS. 1 b, 1 c, and 1 d, as one example of the overall system. More specifically, before an executable binary is executed in this embodiment, the packing manager system 120 marks its memory pages as non-writable but executable. FIG. 1 b illustrates this by showing Hello_upx.exe 102 from FIG. 1 a, with its sections UPX0 and UPX1 marked by the system as non-writable (NW) but executable (X), as illustrating by the shading in FIG. 1 b. When the unpacker starts restoring the original program, it will write to non-writable pages (marked NW by the packing manager) and will cause a write access violation exception for the first write to each such page. This is illustrated in FIG. 1 c where the unpacker code of Hello_upx.exe 102 derives the original program from packed data and writes it to pages in UPX0. The virtual memory hardware automatically detects the write exception and the operating system delivers this exception to the user process. The packing manager 120 intercepts and handles each such exception by marking the associated page as writable but non-executable and resuming the execution so that the unpacker can continue restoring the original program. FIG. 1 c illustrates this by showing UPX0 of Hello_upx.exe 102 with different shading now that represents writable (W) but not executable (NX).

After the unpacker fully restores the original program, it transfers control to the entry-point (the first instruction) of the original program. The virtual memory hardware detects an execution exception because that page is writable but not executable. The operating system again delivers this exception to the user process. The packing manager 120 catches this exception and calls AV scanner to scan the address space snapshot at this point. For example, the system can dump the process's address space image for an AV scanner to scan. If the AV scanner detects malware, the packing manager 120 terminates the process. If the AV scanner does not find any match based on a virus signature scan of the dumped image, the packing manager 120 changes the pages to executable and non-writable. The program can thus continue execution without problem. FIG. 1 d illustrates this by showing that the shading of UPX0 of Hello_upx.exe 102 has changed to now be executable (X) but not writable (NW). This embodiment of the packing manager 120 can be executed in hardware, software, or both.

In this embodiment, the packing manager 120 is generic because the way it tracks the code pages of the original programs does not require any packer-specific knowledge. Moreover, it is also fully automated because no human intervention is needed. Finally, it is efficient because page status tracking can be done transparently by virtual memory hardware, and thus does not incur any software overhead. However, in other implementations of the packing manager 120, it does not rely on virtual memory hardware. For example, the packing manager 120 can instead run in an emulator that looks at each instruction to determine if any instruction has been modified. In each implementation, however, the packing manager 120 detects the control transfer to a dynamically generated page.

The packing manager system 120 can deal with multiple-layer packed binaries equally effectively without any modification. Since each layer of packer exhibits the same behavior, executing dynamically generated code, each layer will be unpacked in the reverse order of their packing sequence. If the latest unpacking round is not the last round, its output is an intermediate packed binary that needs to be unpacked. In this case, the program resumes and the packing manager 120 continues to track pages that are dynamically generated and executed. Finally the original program is restored and unpacking comes to the last round. The original program is scanned before execution, and will be prevented from executing if the AV scanner determines it to be malware.

As used herein, the term “AV scanner,” “AV program,” “AV engine,” and the like can include any type of computer security or anti-malicious code prevention or detection software located on a client computer, server, or networked computer, including e.g., signature scanners, behavior blockers, intrusion detection scanners, etc. The term, “unpacking,” as used here includes unpacking, unencrypting, descrambling, de-obfuscating, etc. An “exception,” as used herein is a signal generated when a program attempts to write to a non-writable page or to execute a non-executable page, or it can be a hardware interrupt, a software signal, etc.

FIG. 2 is a high-level block diagram illustrating a functional view of a typical computer system 200 for storing and executing the packing manager system, according to one embodiment of the present invention. This computer system 200 can act as a client computer 110, as shown in FIG. 1. However, one or more of the components of the computer system 200 may be missing or modified in the client computer 110. Illustrated is a processor 202 coupled to a bus 204. Also coupled to the bus 204 are a memory 206, a storage device 208, a keyboard 210, a graphics adapter 212, a pointing device 214, and a network adapter 216. A display 218 is coupled to the graphics adapter 212.

The processor 202 may be any general-purpose processor such as an INTEL x86, SUN MICROSYSTEMS SPARC, or POWERPC compatible-CPU, or the processor 202 may also be a custom-built processor. The memory 206 may be, for example, firmware, read-only memory (ROM), non-volatile random access memory (NVRAM), and/or RAM, virtual memory, memory-mapped or disk-based memory storage, etc., and holds instructions and data used by the processor 202. The storage device 208 is, in one embodiment, a hard disk drive but can also be any other device capable of storing data, such as a writeable compact disk (CD) or DVD, and/or a solid-state memory device. The pointing device 214 may be a mouse, track ball, or other type of pointing device, and is used in combination with the keyboard 210 to input data into the computer system 200. The graphics adapter 212 displays images and other information on the display 218. The network adapter 216 couples the client computer 110 with the Internet 112.

As is known in the art, the computer system 200 is adapted to execute computer program modules for providing functionality described herein. In this description, the term “module” or “manager” refers to computer program logic for providing the specified functionality. A module/manager can be implemented in hardware, firmware, and/or software. Where the any of the modules/managers described herein are implemented as software, the module/manager can be implemented as a standalone program, but can also be implemented in other ways, for example as part of a larger program, as a plurality of separate programs, or as one or more statically or dynamically linked libraries. It will be understood that the modules/managers described herein represent one embodiment of the present invention. Certain embodiments may include other modules/managers. In addition, the embodiments may lack modules described herein and/or distribute the described functionality among the modules/managers in a different manner. Additionally, the functionalities attributed to more than one module/manager can be incorporated into a single module. In one embodiment of the present invention, the modules/managers are stored on the storage device 208, loaded into the memory 206, and executed by the processor 202. Alternatively, hardware or software modules/managers may be stored elsewhere within the computer system 200. Similarly, a computer program product comprising a computer-readable medium (e.g., a CD-ROM, a tape, a DVD, memory, flash memory, etc.) containing computer program code for performing functionalities described here is contemplated.

FIG. 3 is a high-level block diagram illustrating the functional modules within the packing manager 120, according to one embodiment of the present invention. The packing manager 120, in the embodiment illustrated in FIG. 3, includes a marking module 302, a dynamic content detection module 304 having an exception handler 306, a permission module 308, an execution detection module 310 having an exception handler 312, an analysis module 314, and a memory manager 320. FIG. 3 also illustrates a malicious code detection module 340, as described below. Those of skill in the art will recognize that other embodiments can have different and/or additional modules than those shown in FIG. 3 and the other figures. Likewise, the functionalities can be distributed among the modules in a manner different than described herein. When describing the embodiments of the invention below, the examples frequently focus on the virtual memory hardware embodiment as an example of how the packing manager 120 can function. However, throughout the specification, the examples used also apply to the other embodiments of the packing manager 120, as previously described.

In some embodiments, the packing manager 120 logic is first injected into a process, and the packing manager state is initialized. This injection can occur in various ways. For example, the packing manager 120 logic can be implemented as a dynamic-link library (DLL). When the process is created, the packing manager DLL is injected into the process. There are several well-known technologies to achieve this goal. The advantage of this approach is that it is easy to make the code reliable and portable to different versions of operating systems. Another approach is to implement the packing manager 120 logic in an operating system kernel, such as WINDOWS® Kernel. Thus, any process will be automatically tracked by the kernel packing manager 120 code. The advantage of this approach is that the packing manager 120 can be stealthier. One disadvantage regarding the WINDOWS® Operating System, for example, is that it is hard to port to WINDOWS® for x64 platform because of the patchguard.

Once the packing manager 120 logic is injected, it can perform various processes. For example, the marking module 302 of the packing manager 120 can perform an initialization that includes first contacting the memory manager 320 to find the address space where the program or target process is loaded and recording the original protection setting of the memory pages in this address space. As used herein, the term “memory page” is a block of memory as defined by a memory manager, a memory management unit in a processor, a virtual memory management system, a virtual machine memory manager, or another hardware or software predefined region of memory. The memory manager can be implemented in hardware, software, or both. In some embodiments, the memory manager is virtual memory protection hardware, such as that present in modern CPUs (e.g., X86), which can be used to track pages that are written to and executed.

In some embodiments, the marking module 302 then initially marks these memory pages where the program is loaded as executable but not writable prior to any detection by the module 302 (as also illustrated in FIG. 1 b). If the program is malware packed by a packer, it will first need to unpack the malware by writing it to one or more of the memory pages. However, because the marking module 302 initialized the memory pages to be non-writable, when the unpacker routine starts restoring the original program, it will write to the non-writable pages, as marked by the marking module 302, and it will thus cause a write access violation exception for the first write to any such marked page. If the page(s) are already marked not writable, but executable, this marking by module 302 can be skipped.

The dynamic content detection module 304 of the packing manager 120, for example, can detect an attempt by a program to dynamically generate content to a memory page, wherein the program is suspected to be unpacking malicious executable code to the memory page. The module 304 can perform the detection in various manners. In one embodiment, the module 304 includes an exception handler 306 that can intercept and handle a first exception generated by the memory manager 320 when the program attempts to dynamically generate content to the memory page set to be executable but not writable by the marking module 302. When the write access violation described above occurs because the program wrote to a non-writable page, the exception handler 306 catches an exception generated by the memory manager 320 so the packing manager 120 can intercept and handle the exception.

The marking module 302 marks the memory page to which the program attempted to dynamically generate content, as detected by the module 304. In one embodiment, the marking module 302 handles the exception generated by the exception handler 306 by doing this marking of the memory page. More specifically, in this embodiment, the marking module 302 handles the exception by marking the memory page to which the program attempted to write as writable but not executable. Thus, the program can continue its writing because the page is now marked writable again without the program even necessarily realizing that there was any sort of exception generated. If the program is packed malware, the unpacker can continue its unpacking of the original malware program. In some embodiments, the marking module 302 marks the memory page by tracking or recording the page to which the program attempted to dynamically generate content.

The permission module 308 allows the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page. As explained above, the marking module 302 switches each page to which the program is attempting to write from non-writable but executable to writable but not executable upon the first attempt by the program to write to that page. Once this occurs, the module 308 allows the program to continue its writing to the one or more memory pages. For the writeable, non-executable memory pages, no further action need by taken by the packing manager 120 until execution occurs because the module 308 will ensure that the program can continue to dynamically generate content for writeable, non-executable pages until the program makes an attempt at execution. Thus, the packing manager 120 does not need to interfere with or otherwise slow the dynamic generation of content by the program on the writeable, non-executable memory pages. The packing manager 120 further does not need to waste resources monitoring this dynamic generation of content since the memory pages to which the writing is occurring are marked non-executable. In this manner, the packing manager 120 has ensured that, whatever happens in the meantime, the program cannot execute dynamically generated code without the packing manager 120 first being notified. In some embodiments, if the program attempts to write to another memory page, and exception is again generated by the memory manager 320 since the page is still marked as non-writeable but executable (as marked initially).

The execution detection module 310 detects an attempt by the program to execute the code stored in the memory page, the detection occurring before the execution occurs. The module 310 can perform the detection in various manners. In one embodiment, the module 310 includes an exception handler 312 for intercepting and handling a second exception generated by the memory manager 320 upon detection of the attempt by the program to execute the code. If the program is packed malware, after the unpacker fully restores the original program, it will transfer control to the program's entry-point. However, when the program attempts to execute, this will cause an execution access violation exception inside the memory manager 320 because the pages are marked writable but not executable. The exception generated is caught by the exception handler 312. Specifically, the page is marked non-executable because its content is dynamically generated, and the dynamic generation was detected originally by the detection module 304, and then the page was marked as non-executable by the marking module 302.

The exception handler 312 can work in the same manner as the exception handler 306, and in some embodiments, they are both included in one exception handler. The exception handler can also be a standalone module. These exception handlers thus can enforce write and execution in a mutually exclusive manner so that any execution of dynamically generated code will be caught. The following is an example of pseudo code that might be implemented in the exception handlers 306, 312.

{

if write exception

-   -   change page to be writable but not executable

if execution exception

-   -   dump the memory image of the target process     -   scan the dumped file     -   if malware         -   terminate the process     -   else         -   change the page to be executable

if others

-   -   pass the exception to the program         }

The analysis module 314 provides to a malicious code detection module 340 the memory page for analysis for the presence of malicious code. In providing the memory page, for example, the analysis module 314 can provide a descriptor such as the memory address location of the page or a pointer to the page, a dump or copy of the contents of the page, execute a callback function on the memory page from the malicious code detection module, an in-memory portable executable (PE) file, etc. For example, the analysis module 314 can present a file that is already in memory to module 340, and since it is already in memory, there is no need to dump it out to disk for module 340 to read from disk to memory again. Thus, the file is presented in an efficient and convenient manner to module 340.

The malicious code detection module 340 can be an AV scanner for scanning the program for the presence of malicious code. In FIG. 3, the malicious code detection module 340 is shown as separate from the packing manager 120, but it could also be included partially or entirely within the packing manager 120. In some embodiments, at each execution access violation exception, the analysis module 314 dumps the process's address space image for an AV scanner to scan for malware. In some embodiments, the AV scanner uses standard virus signature detection methods known in the art to scan the program to determine if a known virus signature is contained in the program. If the AV scanner does not find any match in the dumped image, the program is permitted to continue its execution without further interruption. However, if the AV scanner/detection module 340 detects malware, then appropriate action can be taken to prevent this malware from executing and infecting or otherwise harming the computer. For example, the malware can be quarantined for further examination, the user can be notified that malware was detected, etc. In some embodiments, the module 314 provides only the one or more memory pages to which the program dynamically generated content and then attempted to execute.

In some embodiments, the program is packed by multiple layers of packers, and these can be different, arbitrary packers, some of which may be proprietary packers or packers that were not previously known or seen before by computer security companies (e.g., for which antivirus signatures are not currently available). For each layer of packing, the modules described above can repeat the steps. For example, a first layer of unpacking may only unpack a program that is still packed by one or more further layers of packers. Thus, when there is an attempt to execute this program that is still packed (e.g., control is first transferred to the pages whose content are dynamically generated), the analysis module 314 can provide these pages for scanning. Since the original program is still packed by one or more packers, the scanner may or may not detect that there is malware contained, depending on whether or not there is a signature available for that packer that was unpacked. If there is determined not to be malicious code in that layer that was unpacked, the unpacking process can continue, resulting in another layer being unpacked. Again, there is detection of the end of the unpacking, and before execution of the unpacked code occurs a scan can be performed. These steps performed by the modules above can be repeated until eventually the last layer of unpacking occurs, and the original program is unpacked. The packing manager 120 again detects the point in the unpacking process when the program attempts to execute the memory page with dynamically generated content. If this original program contains malicious code, the analysis of the memory page performed by the malicious code detection module will likely detect it because the memory page now contains unpacked malicious code, and thus the malicious code will be caught before execution and can be prevented from harming the computer.

Referring now to FIG. 4, there is shown a flowchart illustrating the operation of the packing manager 120, according to some embodiments of the present invention. Different embodiments of the packing manager 120 may perform the illustrated steps in different orders, omit certain steps, and/or perform additional steps not shown in FIG. 4.

As shown in FIG. 4, in some embodiments, the packing manager 120 initially marks 402 the memory page as executable but not writable. As explained above, the packing manager 120 can find the address space where the program or target process is loaded and can record the original protection setting of the memory pages in this address space. If the memory pages are initially set to executable and writable, the packing manager can change these protection settings by marking each of the memory pages as executable but not writable.

The packing manager 120 detects 404 an attempt by the program to dynamically generate content to a memory page. The packing manager 120 can perform its overall method on any program on a computer, but especially if program is suspected to be unpacking malicious executable code to the memory page. The packing manager 120 performs this detection 404, in some embodiments, when a first exception generated. Specifically, an exception or interrupt can be generated when the program attempts to dynamically generate the content to the memory page since the page was previously set to be executable but not writable by the packing manager 120 (or was already initially set to be non-writable, in which case the packing manager 120 did not need to reset the page).

The packing manager 120 marks 406 the memory page to which the program attempted to dynamically generate content. In some embodiments, this marking occurs in response to the write access violation that occurs when the program attempts to write to the non-writable pages. As explained above, an exception can be generated by a memory manager, and the packing manager 120 intercepts and handles the exception. One way in which the packing manager 120 handles this exception is by marking 406 the memory page as writeable but not executable. Specifically, the packing manager 120 can switch the page that is currently marked as executable but not writeable to writeable but not executable. The packing manager 120 can thus allow 408 the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page. The program can now continue to write to the page without causing further write access violations since the page is now writeable. However, since the page was switched to non-executable, the program will not generate any further access violations until it attempts to execute. If the program attempts to write to another page, steps 404, 406, and 408 can be repeated.

The packing manager 120 then detects 410 an attempt by the program to execute the code stored in the memory page. This detection occurs before the execution occurs, thus allowing the packing manager 120 to be notified when the program first tries to execute the dynamically generated content. The packing manager 120 performs this detection 410, in some embodiments, when a second exception is generated by a memory manager upon detection of the attempt by the program to execute the code. The packing manager 120 can again intercept the exception and take action. The packing manager 120 provides 412 to a malicious code detection module the memory page for analysis for the presence of malicious code. For example, the packing manager 120 can dump the process's address space image for the malicious code detection module to scan or otherwise perform an analysis for malware (e.g., using standard virus signature detection methods known in the art). If the malicious code detection module does not find any match in the dumped image, the program is permitted to continue its execution without further interruption. However, if the malicious code detection module detects malicious code in the program, then appropriate action can be taken to prevent this malicious code from harming the computer.

As explained above, some programs may be packed by more than one layer of packers. In some cases, there may be multiple layers of arbitrary packers of different types, some or all of which may not have been previously known (e.g., no signatures available). In some cases, the same packer may be applied more than once. Steps 402-412 can be repeated for each layer of packer, as explained above. When the original program is finally unpacked, the packing manager 120 will detect the point in time between the end of unpacking and the execution of the original program. Thus, if this program contains malicious code, this malicious code can be detected when the program is provided 412 for analysis. In other words, the packing manager 120 can refuse to execute any dynamically generated code without scanning first. Thus, at the finish of each layer of unpacking, but before execution, a scan will occur. In some embodiments, the scan occurs immediately after unpacking is concluded. However, it is also contemplated that additional steps could occur after unpacking ends but before execution, so the scanning can occur after these steps but still before execution.

Numerous specific details have been set forth herein to provide a thorough understanding of the embodiments. It will be understood by those skilled in the art, however, that the embodiments may be practiced without these specific details. In other instances, well-known operations, components and circuits have not been described in detail so as not to obscure the embodiments. It can be appreciated that the specific structural and functional details disclosed herein may be representative and do not necessarily limit the scope of the embodiments.

Unless specifically stated otherwise, it may be appreciated that terms such as “processing,” “computing,” “calculating,” “determining,” or the like, refer to the action and/or processes of a computer or computing system, or similar electronic computing device, that manipulates and/or transforms data represented as physical quantities (e.g., electronic) within the computing system's registers and/or memories into other data similarly represented as physical quantities within the computing system's memories, registers or other such information storage, transmission or display devices. The embodiments are not limited in this context.

As used herein any reference to “one embodiment” or “an embodiment” or “some embodiments” means that a particular element, feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.

As will be understood by those familiar with the art, the invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. Likewise, the particular naming and division of the modules, managers, features, attributes, methodologies and other aspects are not mandatory or significant, and the mechanisms that implement the invention or its features may have different names, divisions and/or formats. Furthermore, as will be apparent to one of ordinary skill in the relevant art, the modules, managers, features, attributes, methodologies and other aspects of the invention can be implemented as software, hardware, firmware or any combination of the three. Of course, wherever a component of the present invention is implemented as software, the component can be implemented as a script, as a standalone program, as part of a larger program, as a plurality of separate scripts and/or programs, as a statically or dynamically linked library, as a kernel loadable module, as a device driver, and/or in every and any other way known now or in the future to those of skill in the art of computer programming. Additionally, the present invention is in no way limited to implementation in any specific programming language, or for any specific operating system or environment. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims. 

1. A computer program product for automated detection of dynamically unpacked malicious code, the computer program product comprising a non-transitory computer-readable medium containing computer program code for performing the method comprising: detecting an attempt by a program to dynamically generate content to a memory page, wherein the program is suspected to be unpacking malicious executable code to the memory page; marking the memory page to which the program attempted to dynamically generate content; allowing the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page; detecting an attempt by the program to execute the code stored in the memory page, the detection occurring before the execution occurs; and providing to a malicious code detection module the memory page for analysis for the presence of malicious code.
 2. The computer program product of claim 1, further comprising initially marking the memory page as executable but not writable prior to detecting the attempt to dynamically generate content to the memory page, wherein the detection of the attempt to dynamically generate content further comprises intercepting a first exception generated by a memory manager when the program attempts to dynamically generate the content to the memory page set to be executable but not writable.
 3. The computer program product of claim 1, wherein the marking comprises marking the memory page as writable but not executable, and wherein the detection of the attempt execute the program further comprises intercepting a second exception generated by the memory manager when the program attempts to execute the code written to the memory page set to be writable but not executable.
 4. The computer program product of claim 3, wherein the memory manager is implemented in hardware.
 5. The computer program product of claim 1, wherein the program is packed by multiple layers of packers, and wherein the method further comprises repeating the steps for each layer of unpacking that occurs.
 6. The computer program product of claim 5, wherein one or more of the multiple layers of packers are arbitrary packers for which antivirus signatures are not available.
 7. The computer program product of claim 1, wherein the program is packed by a packer and is being unpacked by an unpacking process, wherein detection of the attempt to execute further comprises detecting a point in the unpacking process after unpacking has concluded but before the program executes the memory page with dynamically generated content, and wherein the providing step further comprises providing the unpacked program to an antivirus scanner to scan for malicious code.
 8. A computer-implemented method of automated detection of dynamically unpacked malicious code, the method comprising: detecting an attempt by a program to dynamically generate content to a memory page, wherein the program is suspected to be unpacking malicious executable code to the memory page; marking the memory page to which the program attempted to dynamically generate content; allowing the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page; detecting an attempt by the program to execute the code stored in the memory page, the detection occurring before the execution occurs; and providing to a malicious code detection module the memory page for analysis for the presence of malicious code.
 9. The method of claim 8, further comprising initially marking the memory page as executable but not writable prior to detecting the attempt to dynamically generate content to the memory page, wherein the detection of the attempt to dynamically generate content further comprises intercepting a first exception generated by a memory manager when the program attempts to dynamically generate the content to the memory page set to be executable but not writable.
 10. The method of claim 9, wherein the marking comprises marking the memory page as writable but not executable, and wherein the detection of the attempt execute the program further comprises intercepting a second exception generated by the memory manager when the program attempts to execute the code written to the memory page set to be writable but not executable.
 11. The method of claim 10, wherein the memory manager is implemented in hardware.
 12. The method of claim 8, wherein the program is packed by multiple layers of packers, and wherein the method further comprises repeating the steps for each layer of unpacking that occurs.
 13. The method of claim 8, wherein the program is packed by a packer and is being unpacked by an unpacking process, wherein detection of the attempt to execute further comprises detecting a point in the unpacking process after unpacking has concluded but before the program executes the memory page with dynamically generated content, and wherein the providing step further comprises providing the unpacked program to an antivirus scanner to scan for malicious code.
 14. A computer system automated detection of dynamically unpacked malicious code, the system comprising: a dynamic content detection module for detecting an attempt by a program to dynamically generate content to a memory page, wherein the program is suspected to be unpacking malicious executable code to the memory page; a marking module for marking the memory page to which the program attempted to dynamically generate content; a permission module for allowing the program to dynamically generate the content to the memory page until the program attempts to execute code stored on the memory page; an execution detection module detecting an attempt by the program to execute the code stored in the memory page, the detection occurring before the execution occurs; and an analysis module for providing to a malicious code detection module the memory page for analysis for the presence of malicious code.
 15. The system of claim 14, further comprising the marking module initially marking the memory page as executable but not writable prior to detecting the attempt to dynamically generate content to the memory page, wherein the execution detection module further comprises an exception handler for intercepting a first exception generated by a memory manager when the program attempts to dynamically generate the content to the memory page set to be executable but not writable.
 16. The system of claim 15, wherein the marking module is further configured for marking the memory page as writable but not executable, and wherein execution detection module further comprises an exception handler for intercepting a second exception generated by the memory manager when the program attempts to execute the code written to the memory page set to be writable but not executable.
 17. The system of claim 16, wherein the memory manager is implemented in hardware.
 18. The system of claim 16, wherein the memory manager is implemented in software.
 19. The system of claim 14, wherein the program is packed by multiple layers of packers, and wherein the modules are further configured to repeat the steps for each layer of unpacking that occurs.
 20. The system of claim 14, wherein the program is packed by a packer and is being unpacked by an unpacking process, wherein the execution detection module is further configured for detecting a point in the unpacking process after unpacking has concluded but before the program executes the memory page with dynamically generated content, and wherein the analysis module is further configured for providing the unpacked program to an antivirus scanner to scan for malicious code. 